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NUCLEIC ACID AND ALLERGENIC 
POLYPEPTIDES ENCODED THEREBY 
IN CASHEW NUTS (Anacardium occidentale) 



Related Application 
This application claims priority from co-pending U.S. provisional application 
Serial No. 60/423,556, which was filed on November 4, 2002, and which is 
incorporated herein by reference in its entirety. 

Field Of The Invention 

The present invention relates generally to the field of cell biology and, more 
particularly, to a nucleic acid sequence and the polypeptides encoded thereby which 
contain an allergen found in cashew nuts. 



Background Of The Invention 

Allergic reactions to cashew nuts {Anacardium occidentale), though relatively 
infrequent, can be varied and even life threatening reactions include contact or 
systemic dermatitis (to cardol and anacardic acid found In the cashew nut shell 

20 oil)(Marks et al. 1984). atopic dermatitis, and IgE-mediated systemic allergic 
reactions.(Burks et al. 1998; Tariq et al. 1996; Garcia et al. 2000) Pistachios and 
mangos are other edible members of the Anacardiaceae family and are allergenic 
with pistachio showing extensive in vitro and possible clinical cross-reactivity with 
cashews. (Fernandez, Fiandor, Martinez-Garate, and Martinez 1995) (Pan-a et al. 

25 1993) (Querela, Rafanelli, Marsigli, Foschi, and Stefanini 1999). Cashew nuts are 
widely used in snack foods and as an ingredient in a variety of processed foods such 
as "butters", bakery and confectionery products. 

We have recently demonstrated that the major IgE-reactive proteins in 
extracts of cashew nut are legumin-like proteins and 2S albumins as assessed by 

30 N-temiinal and enzymatic fragment sequencing of native proteins. (Teuber, Sathe, 
Peterson, and Roux 2002) We have also constructed a cashew nut cDNA library 
and, upon initial screening, cloned and sequenced an allergen in the 7S superfamily, 
which includes vicilin-like and sucrose binding proteins. (Wang etal. 2002) Here, we 
present the sequence and characteristics of a second cDNA encoding protein. 
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designated Ana o 2, which is a member of the legumin (1 1 S globulin) family of seed 
storage proteins. We also describe its reaction with sera from cashew-allergic 
patients and map its linear epitopes. 
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Summary Of The Invention 

With the foregoing in mind, the present invention describes a cloned, 
expressed and characterized a major cashew allergen. A cashew cDNA library was 
screened with human IgE and rabbit IgG anti-cashew extract antisera, and a reactive 
non-vicilin clone, sequenced and expressed as a fusion protein in E. coll. 
Immunoblotting was used to screen for reactivity with patient's sera and inhibition of 
immunoblottiiig was used to identify the corresponding native peptides In cashew nut 
extract. The identified allergen was subjected to linear epitope mapping using the 
SPOTS solid phase synthetic peptide technology. 

Sequence analysis showed the selected clone, designated Ana o 2, to be a 
member of the legumin family (an 11S globulin) of seed storage proteins. By 
immunoblotting, 1 3 of 21 (62%) of sera from cashew allergic patients were reactive. 
Immunoblot inhibition data showed the native Ana o 2 constitutes a major band at 
-33 kD and a minor band at -55 kD. Probing of overlapping synthetic peptides with 
pooled human cashew-allergic sera identified 22 reactive peptides, 7 of which gave 
strong signals. Several Ana o 2 epitopes were shown to overlap those of the peanut 
allergenic legumin, Ara h 3, in position but with little sequence similarity. Greater 
positional overlap and identity was observed between Ana o 2 and soybean glycinin 
epitopes. Conclusion: We conclude that this legumin-like pirofein is a major allergen 
in cashew nuts. 

Brief Description Of The Drawings 

Some of the features, advantages, and benefits of the present Invention 
having been stated, others will become apparent as the description proceeds when 
taken in conjunction with the accompanying drawings in which: 
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FIG 1 shows the nucleotide and derived amino acid sequence analysis of Ana 
o 2 cDNA clone Ana o 2; this figure includes subfigures A1 and 2. B, C, and D, ail of 
which may be collectively refen-ed to as FIG. 1, and wherein 

FIGS. 1 A1 and 1A2, respectively, show the nucleotide sequence 
encoding for the polypeptide of SEQ ID NO:1, referred to as Ana-o-2. including a 
stop codon shown underlined and in bold; and the polypeptide of SEQ ID NO:2, 
which is a tail for the polypeptide of SEQ ID NO:1; 

FIG. 1 B shows the derived amino acid sequence of the Ana o 2 coding 
region (SEQ ID NO:1) shown with the presumed signal sequence underlined, sites 
of sequence insertions shown in bold text, deletions shown in double underline in the 
nine most similar sequences in GenBank (see Table I) as compared to Ana o 2 
indicated; sequences similar to previously reported cashew legumin group proteins 
derived by amino acid sequencing, (Teuber 2002) are shown in bold italics; 

FIG. 1 C shows details on the sites of insertion (red) and 
deletions(blue) as described herein; a vertical line between residues indicates the 
sites of insertions in similar peptides with the insertion size (number of amino acids) 
and plant species (two or three letter code) shown above the insertion site; deletions 
are underiined or overlined, and only deletions and insertions of three or more 
residues are included; abbreviations used are as follows: Rc, Ricinus communis; 
Cav, Corylus avellana; Qr, Quercus robur, Ahy. Amaranthus hypochondriacus; Si. 
Sesamum indicum; Ms, Magnolia salicifolia; Vs, Vicia sativa; Cp, Cucurbita pepo; 
Car, Coffea arabica; Ah3, Arachis hypogaea (Ara h 3); and Ah4, Aractiis tiypogaea 
(Ara h 4); and 

FIG. 1 D shows a comparison of cDNA-derived Ana o 2 sequences to 
similar sequences of previously reported cashew legumin group proteins derived by 
amino acid sequencing. (Teuber et al. 2002) Identical residues are in bold. 
Accession number: AF453947; 

FIG 2 shows the identification of native Ana o 2 peptide in cashew extract by 
means of Western blotting, wherein nitrocellulose blots of total cashew extract (A) 
and rAna o 2 (B) were probed with serum firom cashew-allergic patient (#9) and 
tagged with 1251-labeled anti human IgE; the first (left) lane in each blot was 
incubated without Inhibition; the second through fourth lanes were incubated with 
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rAna o 2, rAna o 1 and/or MBP as indicated; the right lanes were incubated with 
control human sera; arrows indicate locations of native Ana o 1 , top, Ana o 2 (acidic 
subunit), bottom, and an, as yet, unidentified band, middle; the lower and upper 
bands in (B) representing the uncleaved (precursor) fonn of rAna o 2 in its monomer 
and presumptive dimer configurations, respectively; and 

FIG 3 shows a comparison of the IgE-binding epitopes of various 11S 
globulins to those of the cashew legumin. Ana o 2; bracketed and numbered regions 
indicate locations of the reactive peptides In Ana o 2 (also listed in TABLE III); 
shaded segments indicate regions expressing one or more linear epitopes on the 
different allergens; identical amino acids are indicated with a ""while similar residues 
are depicted wnth a "1"; the symbol "f shows where a portion of a sequence was 
deleted and a "-" represents where a space was added in the sequence to maximize 
alignment; the presumed cleavage site between the acidic and basic subunits 
(between amino acids 271 and 272) is shown in text above the sequence; boxes 
denote regions where numerous insertions and deletions are found when comparing 
multiple legumin sequences (see Fig 1C). 

Detailed Description of the Preferred Embodiment 

The present invention will now be described more fully hereinafter with 
reference to the accompanying drawings, in which preferred embodiments of the 
invention are shown. Unless othenwise defined, all technical and scientific terms 
used herein have the same meaning as commonly understood by one of ordinary 
skill in the art to which this invention pertains. Although methods and materials 
similar or equivalent to those described herein can be used in the practice or testing 
of the present invention, suitable methods and materials are described below. All 
publications, patent applications, patents, and other references mentioned herein are 
incorporated by reference in their entirety. In case of conflict, the present 
specification, including any definitions, will control. In addition, the materials, 
methods and examples given are illustrative in nature only and not intended to be 
limiting. Accordingly, this invention may be embodied in many different forms and 
should not be constmed as limited to the illustrated embodiments set forth herein. 
Moreover, these illustrated embodiments are provided so that this disclosure will be 
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thorough and complete, and will fully convey the scope of the invention to those 
skilled In the art. Other features and advantages of the invention will be apparent 
from the following detailed description, and from the claims. 

Abbreviations Used 

Amino acids: the standard one-letter abbreviations are used. 

DTT: dithiothreitol. 

HRP: horseradish peroxidase. 

Nucleotides: the standard one-letter abbreviations are used. 
PGR: polymerase chain reaction. 
PBS: phosphate buffered saline. 

DS-PAGE: sodium dodecyl sulfate polyacrylamide gel electrophoresis. 
TBS: tris-buffered saline. 

Methods 

Human sera. 

Blood samples were drawn after informed consent from patients with 
life-threatening systemic reactions to cashew nut and the sera frozen at -70*C until 
use. The study was approved by the human subjects review committee of the 
University of California at Davis. The presence of cashew-reactive IgE was 
confinned by Phamiacia ImmunoCAP assay (Phamriacia, Uppsala, Sweden) or by 
Westem immunoblotting. Control sera were obtained from patients with a history of 
anaphylaxis to walnut, pistachio or hazelnut who reported tolerance of cashews. 

Cashew protein extract 

An albumin/globulin extract was prepared as previously described. Protein 
concentrations were measured by use of the Bradford protein assay (BioRad 
Laboratories, Inc., Hercules, CA) using bovine serum albumin as the standard 
protein. 

Production of rabbit polyclonal antiserum. 

A rabbit was immunized with 5 mg of cashew extract in Freund's complete 
adjuvant and boosted four weeks later with 5 mg of cashew extract in incomplete 
Freund's adjuvant. The rabbit was subsequently bled and the semm stored at 
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-ao^C. Guidelines for animal care and welfare as in the "Guide for the Care and Use 
of Laboratory Animals" prepared by the Institute of Laboratory Animal resources, 
National Research Council, National Academy Press, revised 1996, were followed. 
Construction and IgE immunoscreening of cashew cDNA library. 
Library construction and immunoscreening have previously been described 
in detail.(Wang 2002) Briefly, cashew nuts in late maturation were chopped, frozen, 
and ground. Total RNA was extracted in TRIzol (GIBCO BRL Life Technologies Inc.. 
Rockville, MD) and mRNA was isolated using a PolyATtract kit (Promega. Madison, 
Wl). The cDNA library was constructed using the Uni-ZAPXR Gigapack Cloning Kit 
(Stratagene Inc., Cedar Creek, TX) and cloned into the lambda Unl-ZAP XR 
expression vector. The library was amplified in E. coll strain XL1-Blue and screened 
with rabbit (IgG) and human (IgE) antisera. The immunopositive clones were picked, 
plaque-purified, and stored in SM buffer supplemented with 2% chloroform at 4'C. 
Sequencing and corresponding analysis of selected genes. 
Inserts from the selected phage clones were amplified with M13 fonward and 
reverse primers by PGR. Both strands of the PGR products were then sequenced 
on an ABI 3100 Genetic Analyzer (Foster City, CA) using capillary electrophoresis 
and Version 2 Big Dye Terminators as described by the manufacturer. Similarity 
searches and alignments of deduced amino acid sequences were performed on 
Genetics Computer Group (GCG) software (Accelrys, Inc. San Diego, CA) using the 
BLAST 2.0 program (www.ncbi.nlm.nih.gov/BLAST/). 

Cloning, expression and purification of cDNA-encoded proteins. 
As previously described in detail (Wang et al. 2002), cDNA coding sequences were 
modified by the addition of a Sal I site at 5' end and a Pst I site at 3'end by PCR 
using PfuTuriJO DNA polymerase (Stratagene Inc., Cedar Creek, TX) followed by 
digestion with Sal I and Pst I and ligation to maltose binding protein (MBP) fusion 
expression vector pMAL-c2 (New England BioLabs Inc., Beverly, MA), into which a 
thrombin cleavage site had been engineered. 

For expression, competent E. coll BL21 (DE3) cells (Novagen Inc., Madison, 
Wl) were transfomned. Positive clones were identified by PCR screening using 
cDNA sequence-specific primers. Single colonies were grown and induced with 
isopropyl-D-thiogalactopyranoside (IPTG). The cells were harvested, lysed with mild 
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sonication. and centrifuged at 12,000g. The lysate supernatant was passed over an 
amylose affinity column and the fusion protein eluted with 10 miy/I maltose. The 
cDNA insert was produced by PGR amplification of the portion of the sequence 
extending from the presumed first codon following the leader peptide through the last 
codon prior the stop codon using the primers 
TCTAGAGTCGACCGCCAGGAATGGCAAC (forward) and 
GCTTGCCTGCAGTTAAGCATCATCCCTCATG (reverse). Fusion proteins were 
cleaved with thrombin (Sigma. St. Louis. MO) and the liberated recombinant protein 
purified on a Superdex 200 column (Amersham Phamiacia, Piscataway. N J) by high 
performance liquid chromatography (HPLC). Cleaved and un-cleaved recombinant 
proteins were concentrated and either stored (briefly) at 4oC until use or frozen at 
-70oC. 

Gel electrophoresis (SDS-PAGE), immunoblotting, and inhibition. 

r Ana 0 2 (at 10 g/4mm width) or total cashew extract (100 g/4mm width) 
samples were boiled in reducing sample buffer and subjected to electrophoresis, 
immunoblotting and inhibition as previously described (Wang et al. 2002). For 
inhibition experiments, 10 g of rAna o 2, rAna o 1 , and/or MBP were pre-incubated 
with human antiserum (from patient #9 at 1:20 dilution) for 1 hr at 37o C and then 
incubated with nitrocellulose strips containing blotted cashew extract overnight at 4o 
C. The strips were then incubated with the appropriate labeled second antibody and 
exposed to x-ray film. 

Solid-phase peptide (SPOTs) synthesis and binding to IgE. 

Based on the derived amino acid sequence of the 457 amino acid Ana o 2 protein 
(including the presumptive leader sequence, amino acids 1-14). 57 overiapping 
15-amino acid peptides, each offset by eight amino acids, were synthesized. An 
additional peptide, corresponding to presumptive N-terminal amino acid, 15-29, was 
also produced. Peptides were synthesized on derivatized cellulose sheets using 
9-fluoreniymethoxy carbonyl-derived (Fmoc) amino acids (Genosys Biotechnologies, 
Inc., The Woodlands, TX) and probed as previously described (Wang et al. 2002). 
Briefly, the peptide-containing membranes were washed in TBS and incubated o/n 
at RT in blocking solution, washed in TBS-T and incubated o/n at 4 'C with pooled 
patients' sera diluted 1 :5 (v:v). This incubation was followed by three 5-min washes 
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in TBS-T and an o/n (4 'C) incubation with 1251-anti-human IgE (Hycor Biomedical 
Inc.) diluted 1:10 in a mixture of PBS, 5% nonfat dry milk, and 0.05% Tween-20. 
After additional washes, IgE-peptide reactivity was identified after a 1 -week exposure 
at -70°C to x-ray film or a 17 hr incubation with a Molecular Dynamics 
phosphoimaging screen (Kodak). Phosphoimaging data was quantified using a 
Storni 860 scanner (Molecular Dynamics) and its accompanying software. 

Results 

cDNA library screening and gene cliaracterization. 

The cDNA library was initially screened for reactivity with both rabbit IgG and 
(cashew-allergic) human IgE. As previously reported, a total of 50 clones displaying 
dual reactivity were selected for further analysis (Wang et al. 2002). The first four 
clones selected for sequencing all proved to encode one of two vicilin variants and 
were designated Ana o 1.1 and Ana o 1.2. A further analysis of the remaining 46 
clones by PGR using Ana o 1 specific primers showed that 45 of the 46 were Ana o 
1 -positive. The remaining clone, upon sequencing and comparison to GenBank, 
proved to be homologous with the legumin family of seed storage proteins (Fig 1A) 
and was designated, Ana o 2. Table I lists the nine proteins having the highest 
degrees of identity (46% to 58%) and similarity (63% to 74%) to Ana o 2 and include 
proteins described as legumin-like, legumin precursor, legumin A precursor, IIS 
globulin, and IIS globulin b-subunit precursor, representing a diversity of tree and 
plant species (Table I). Also listed are four 1 1 S family proteins with somewhat lower 
identity (42%-47%) and similarity (58%-66%) but which have previously been 
Identified as legumin allergens. Two of the allergens are from the peanut, Ara h 3 
and Ara h 4, and two from soybean, the glycinin G1 and G2 subunits. 

Protein sequence cliaracterization. 

Analysis of the nucleotide (Fig 1A) and deduced amino acid sequences (Fig 
1 B) reveal a 457 amino acid open reading frame, a possible start site at position 93, 
and possible leader peptide from 1 to 15. Comparison of the nine most similar 
sequences and the four above mentioned allergens (listed in Table I) with Ana o 2 
amino acid sequence shows clustered sites of insertions (red segments) and 
deletions (blue segments). Figure 1C shows the sites of insertions (blue over and 
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underlined) and deletions (red vertical lines) of four or more amino acids in the 
eleven compared sequences with respect to the Ana o 1 sequence. 
Multiple insertions are seen bet\«een amino acid 14 and 23, 104 and 121, and 180 
and 1 98 with additional scattered insertions between 247 and 271 . In contrast to the 
24 insertion, only four deletions are present, all of which are clustered In the 245 to 
270 region. 

We have previously reported N-tennlnal and tryptic peptide sequences for 
several native IgE reactive cashew proteins (Teuber et al. 2002). Four of these were 
found to have similarity to the cDNA sequence of Ana o 2 (Fig ID and green 
sequence segments in Fig 1B) though none was identical suggesting that Ana o 2 
is a member of a larger gene family or that there is extensive polymorphism at the 
relevant loci. 

Reactivity of the recombinant protein witli human IgE and rabbit igG. 

For immunological characterization, we cloned and expressed Ana o 2 
beginning at nucleotide 47 (after the presumptive leader peptide) through 1 375. The 
DNA segment was ligated into an expression vector designed to allow for purification 
of the recombinant molecule via a mannose-binding protein (MBP) fusion domain in 
conjunction with an amylose affinity column and a thrombin-specific cleavage site. 
The resulting -93 kD Ana o 2 fusion protein was affinity purified as previously 
described and digested with thrombin to yield a -52 kD peptide as well as the 43 kD 
MBP (data not shown). Both cleaved and uncleaved peptides were reactive with 
specific human IgE and rabbit IgG (data not shown). 

Recognition of Ana o 2 as an allergen. 

The prevalence of reactivity to rAna o 2 among cashew allergic patients by 
Western immunoblotting is shown In Table 2. IgE from 13 of 21 (62%) sera from 
patients with a history of life-threatening reactions to cashews bound the 
recombinant. In 6 cases, the intensity of the signal was strong, but was weak in 7 
cases. In contrast, 1 of 10 sera from patients tolerant of cashew but clinically with 
life-threatening reactions to other tree nuts reacted with the rAna o 2. 

Identification of native Ana o 2 by SDS-PAGE immunoblotting. 

Having established that Ana o 2 is a major cashew allergen, we next sought 
to identify the band or bands in a typical total cashew immunoblot that correspond 
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to the cloned polypeptide storage protein precursor. Figure 2A shows nitrocellulose 
strips in which the total soluble cashew extract has been blotted and probed with 
semm from an allergic patient #9. The first strip (left lane) shows this serum reacting 
with three major bands at -55, -36 and -33 kD. The second through fourth lanes 
were incubated with the same human sera preincubated with rAna o 2, rAna o 1 + 
rAna o 2, and the fusion protein, MBP, respectively. Note that Ana o 2 inhibits the 
strong band at -33 kD and weakens the band at -55 kD. The addition of Ana o 1 
inhibits the residual signal at -55 kD. MBP has no inhibitory effect as expected. The 
remaining band at -36 kD is as yet unidentified. The last lane (right) shows no 
reactivity with a patient having allergies to agents other than cashew. 

Figure 2B shows reactivity of the human sera with rAna o 2 (-53 kD) and its 
inhibition with purified rAna o 2 but not rAna o 1 nor MBP. The band at -120 kD is 
presumed to be a dimer of rAna o 2. 

Identification and recognition of IgE-reactive linear epitopes on Ana o 2 

and comparison to peanut and soybean legumin epitopes. 

The entire amino acid length of Ana o 2 was studied by probing 58 
overiapping solid phase synthetic peptides with sera from 12 patients randomly 
assigned to three pools. Collectively, the three pools reacted weakly 
(phosphoimaging score from 2.0-3.9 x 10-3) with 12, moderately (4.0-6.9 x 10-3) 
with three, and strongly ( 7.0 x 10-3) with seven linear IgE-binding epitopes. The 22 
reactive epitopes were distributed throughout the length of the protein (Table III) with 
68% (15/22) of all epitopes residing on the presumed acidic chain of the protein, 
including 86% (6^) of those epitopes that were found to react strongly with pooled 
patient IgE. Only two of the identified epitopes were bound by patients' sera from all 
three pools. Epitope #6 was bound strongly by pools 1 and 3, and moderately by 
pool 2. Epitope #3 was bound strongly by pool 1 , but moderately by pools 2 and 3. 

To compare the linear epitopes of cashew legumin (Ana o 2) to those of 
peanut (Ara h 3)(Rabjohn et al. 1999) and the soybean legumins (G2 and G1 
glycinin) (Helm et al. 2000b) (Beardslee et al. 2000), we aligned the sequences using 
the BLAST 2.0 program and highlighted the corresponding linear epitopes (Fig. 3). 
Of the regions included in the BLAST alignment, only two of the 22 Ana o 2 linear 
epitope-bearing peptides (#1 3 and 15) showed significant (>7 amino acid) positional 
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overlap with any of the four previously identified peanut Ara h 3 epitope-bearing 
peptides (Fig 3) (Rabjohn et al. 1 999). Even among these two overlapping epitope 
regions, there was little identity or similarity between the Ana o 2 and Ara h 3 
(Rabjohn et al. 1999). A much greater degree of conrespondence was observed 
when the sequences of Ana o 2 and G2 glycinin were aligned; 9 of the 11 (89%) 
previously identified G2 glycinin epitopes showed significant positional overlap with 
the Ana o 2 epitopes. All 9 showed a significant degree of similarity ( 60%) and two 
were quite similar in sequence to Ana o 2 epitopes #1 (77%) and the adjacent pair, 
#1 0 and #1 1 (80%) (Fig 3). Two IgE-binding linear epitopes have been identified on 
G1 glycinin (Beardslee et al. 2000), both on the acidic chain. Sequence alignment 
revealed that only the second glycinin epitope displayed significant positional overlap 
and no significant identity with a cashew legumin epitope (#13). Interestingly, this 
epitope (#13) is the only one identified that displays positional overlap with an 
epitope in each of the other three compared IIS globulins (Fig 3) suggesting the 
possibility of a shared structural motif favoring immunogenicity. 
Connparison of rAna o 2 to native peptides. 

We have recently reported the N-terminal and tryptic peptide sequences of 
four legumin proteins bands (Teuber et al. 2002). In comparing these sequences to 
those of rAna o 2, we note that none are identical in sequence (Table IV). One 
tryptic peptide matched at 8 of 9 positions. Two other nearly identical tryptic 
peptides matched rAna o 2 at only 13 of 19 position and 12 of 18 positions. An N- 
terminal peptide matched at 7 or 8 of 1 0 positions beginning at residue 272 of rAna 
o2. Together, these data suggest that these represent a complex family of 
differentially processed proteins. 
Discussion 

Approximately 0.5% of the US population is believed to be allergic to tree nuts 
(Sicherer et al. 1999) and the data from a voluntary registry of peanut and tree nut 
allergic US patients shows 20% of those reporting allergy to tree nuts list sensitivity 
to cashews, the highest percentage for any tree nut. (Sicherer et al. 2001). In 
another study, restricted to pediatric patients (n=54) at Johns Hopkins (Baltimore, 
Maryland, USA), cashew reactivity ranked fourth (1 1 %) behind walnut (26%), almond 
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(13%), and, pecan (13%), perhaps reflecting a delayed dietary introduction of 
cashews relative to other niJts.(Ehrlich P. 1989) 

A number of plant, animal and fungal allergenic proteins have been identified 
by biochemical and molecular genetic means in recent years reflecting the hope that 
the detailed information provided by such studies will lead to the development of 
hypoallergenic foods, new therapeutic and diagnostic tools, and an increased 
understanding of the properties that render some proteins more allergenic than 
others.(Heiss et al. 1999; Valenta R et al. 1999; Bannon et al. 2001; Niederberger 
et al. 2001; Bhalla et al. 2001) 

Our previous screening of a cashew nut cDNA library with human 
cashew-allergic sera primarily yielded clones encoding an IgE-reactive vicilin-like 
protein, designated Ana o1 (Wang et al. 2002). Here we describe the product of the 
single identified clone encoding a legumin-family (IIS globulin) seed storage protein, 
designated Ana o 2. 

The legumins are large (300-450 kD), generally hexameric, non-glycosylated 
members of a complex family of proteins. Individual subunits are found both as 
intact precursor proteins (50-60 kD) and as processed acidic (30-40 kD) and basic 
(20 kD) subunits that generally remain covalently associated in seeds. In addition 
to Ana o 2, several other legumins have been identified as food allergens. These 
include Ara h 3 (Rabjohn et al. 1999) and Ara h 4 (Kleber-Janke et al. 1999) of the 
peanut, the soybean G1 and G2 glycinins (Zeece et al. 1999; Helm et al. 2000a; 
Beardslee et al. 2000), Cor a 9 of the hazelnut (Beyer et al. 2002), and possibly a 
legumin from buckwheat (Fujind et al. 2001 ; Yamada et al. 1 995). The legumins are 
related to each other structurally and are believed to share ancestors in common with 
vicilin-like 7S proteins. The vicilins, or 7S globulins, are also multimeric, digestion 
resistant, seed storage molecules found in many edible plants and are frequently 
allergenic. 

More than 50% of the sera from cashew allergic patients (13 of 21, 62%) 
reacted with rAna o 2 formally making this a major allergen. Six of the sera reacted 
strongly (29%). Because of the known complexity of the legumin family, and the 
differences observed between the single recombinant cashew legumin cDNA-derived 
sequence reported here and the sequenced cashew peptide fragments from 
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IgE-reactive proteins previous reported (Teuber et al. 2002) it would not be 
unexpected If additional reactivity to legumin epitopes not represented on our cloned 
and expressed version were yet to be discovered. Nevertheless, at least for the 
patient's serum tested, it appears that the recombinant protein fully inhibited 
reactivity to the native extracted protein suggesting that most or all members of the 
cashew legumin family share common allergenic epitopes. However, further 
investigations comparing additional cloned versions the legumins to each other and 
to highly purified native molecules for reactivity to human IgE will be needed to 
adequately detemiine If this Ana o 2 clone expresses the full range of cashew 
legumin allergenic epitopes. The unique IgE-binding pattems of each pool of sera 
to the Ana o 2 linear epitopes suggests a heterogeneity of recognition pattems 
between patients though some peptides were recognized by members of two or all 
three pools. 

Linear epitope mapping has shown that the soybean G1 glycinin acidic chain 
may share cross-reactive epitopes with the peanut Ara h 3 (Beardslee et al. 2000). 
Both epitopes mapped from the G1 acidic chain show a considerable degree of 
identity (60% and 70%) with the overlapping region of the homologous Ara h 3 
epitopes though differences were noted in the critical amino acids.(Beardslee et al. 
2000) It has been shown directly that the IgE-binding regions of the soybean G2 
glycinin also bind IgE from peanut sensitive patients [Helm 20001. Such findings are 
not surprising when one considers that both are members of the legume family and 
patients allergic to one have serum IgE antibodies that immunologically cross-react 
with other legumes. (Eigenmann et al. 1996). While no such direct analysis has 
been perfomiedto determine if there is cross-reactivity between our cashew legumin 
(Ana o 2) and that of peanut (Ara h 3) and soybean (G1 and G2 glycinin), the BLAST 
alignment of these protein sequences and subsequent analysis of the positional 
relationship and degree of homology between linear epitopes on each protein has 
been informative. The major findings are that there is considerable positional 
overlap and some shared sequence identify between Ana o 2 and soybean G2 
glycinin but little in common between Ana o 2 and peanut Ara h 3 linear epitopes. 
We have shown that the majority of Ana o 2 linear epitopes (68% overall and 83% 
of the most strongly reactive) are on the acidic chain of the protein. Previous studies 
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have also observed stronger reactivity with the acidic chain rather than the basic 
chain of allergenic legumins [Pederson 1989](Herian et al. 1990; Rabjohn et al. 
1999; Beardslee et al. 2000)[Helm 2000]. 

It is worth noting that the majority of Ana o 2 epitopes are located in the same 
areas of the protein sequence that is evolutionarily least conserved with respect to 
insertions and deletions among a variety of legumins (compare figure 2C with the 
acidic chain of Ana o 2 in figure 3); features that would further minimize 
cross-reactivity between plant legumins. It should also be bome in mind though that 
these various results represent only those reactions between a subset of legumin 
sequences in each of the compared plant seeds and the IgE from a limited set of 
patients. The inclusion of additional legumins and a broader pool of patient sera in 
future analyses could shed further light on these various relationships. 

Clinical infomiation regarding cross-reactivity between cashews and legumes 
has not been well documented, but based on our findings we can predict that while 
there may be a basis for cross-reactivity between cashew and soybean, it is unlikely 
that such cross-reactivity exists between cashew and peanut legumins, at least with 
regard to linear epitopes. Similarly, we previously reported that cross-reactivity 
between the vicilin linear epitopes of cashew (Ana o 1) and peanut (Ara h 1) would 
also be unlikely (Wang et al. 2002), observations which, together, further explains 
the lack of cross-reactivity between tree nut- and peanut-reactive patients' sera 
(Sicherer et al. 2001). 

To date, no common structural characteristics of linear-lgE epitopes have 
been identified (reviewed in (Bredehorst and David 2001)), but our finding that all 
four legumins analyzed share an epitope with, positional overlap, suggests that 
epitope mapping can provide important clues to the nature of allergenicity. 
Three-dimensional modeling of our cashew legumin is continuing and when 
compared to the models of G2 glycinin (Helm et al. 2000b) and other homologous 
allergens, may provide additional insights. 
The Various Aspects of the Claimed Invention. 

The present invention thus discloses isolated nucleic acid sequences, 
polypeptide products thereof, and associated methods. The skilled will recognize 
that the isolated nucleic acids will be useful at least when expressed in a suitable cell 
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or organism to produce the encoded polypeptides, which in turn may be employed 
in testing to identify patients allergic to cashew nuts. Furthennore, expression of the 
nucleic acid sequences of the present invention in a suitable cell may be useful in 
studying and characterizing gene function. 

Accordingly, the present invention includes an isolated nucleic acid sequence 
comprising the nucleotide sequence of SEQ ID NO:1, or a degenerate variant of 
SEQ ID NO:1. SEQ ID NO:1 is the coding sequence for the described Ana-o-2 
cashew antigen, including a stop codon "TAA" at positions 1372-1374. The nucleic 
acid sequence described may also comprise at least one mutation selected from a 
deletion, a substitution, and an addition. A preferred embodiment of the invention 
includes an expression vector comprising the nucleic acid sequence noted above 
operably linked to an expression control sequence, and a cultured cell comprising 
this vector, the cell, or a progeny of the cell, preferably being transfected with the 
vector, wherein the cell expresses a polypeptide encoded by said nucleic acid 
sequence. The nucleic acid of SEQ ID N0:1 may also further comprise additional 
nucleotides according to SEQ ID NO:2, which provides a tail for the coding 
sequence. Additionally, the invention further includes an isolated nucleic acid 
comprising a sequence that encodes a polypeptide comprising the amino acid 
sequence of SEQ ID NO:3. An additional aspect of the Invention includes a method 
of producing a protein by culturing the cell containing nucleic acid sequence SEQ ID 
NO:1 under conditions permitting expression of a polypeptide encoded by the nucleic 
acid. The method may also include purifying the polypeptide from the cell or the 
medium of the cell. 

The invention also discloses twenty-two (22) linear polypeptide epitopes, as 
listed in Table III, which are also respectively listed herein as SEQ ID NO:4 through 
SEQ ID NO:25. Accordingly, the invention includes an isolated nucleic acid 
comprising a sequence that encodes a polypeptide comprising an amino acid 
sequence selected from SEQ ID NOS:4-25. Additionally claimed is a purified Ig-E 
binding polypeptide the amino acid sequence of which comprises at least one 
sequence selected from SEQ ID NOS:4-25, and wherein the polypeptide comprises 
at least one mutation in said selected amino acid sequence, the mutation being 
selected from a deletion, a substitution, an addition, and particulariy a mutation which 
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reduces the polypeptide's IgE-binding capacity but retains at least some 
immunogenicity. Those skilled in the art should recognize that the scope of the 
invention includes an isolated nucleic acid sequence or a degenerate variant thereof, 
which encodes a polypeptide consisting essentially of an amino acid sequence 
selected from SEQ ID NOS:4-25, and that the selected amino acid sequence may 
include at least one mutation such as a deletion, a substitution, or an addition. It 
should be understood that by "essentially" it is meant that the polypeptide Is largely, 
but not wholly, the specified sequence. 

Further method aspects of the disclosed invention include a test for detecting 
a cashew allergy in a patient, said test comprising contacting the patient with an 
amount of a purified polypeptide comprising at least one amino acid sequence 
selected from SEQ ID NOS:3-15. In this test, contacting preferably comprises 
introducing the purified polypeptide into the patient, for example, by injection. The 
skilled will know that injecting includes any of the known routes useful in 
immunological testing of patients, for example, intradermally, and subcutaneously. 
Additionally, the purified polypeptide for use in such a test may comprise at least one 
mutation in said amino acid sequence, and the mutation may be selected from a 
deletion, a substitution, and an addition. 

Yet a further diagnostic test of the invention for detecting anti-cashew IgE In 
a patient to thereby indicate an allergy to cashews, the test comprising several steps. 
The patient's serum is reacted with a purified polypeptide the amino acid sequence 
of which comprises at least one sequence selected from SEQ ID NOS:4-25. The 
polypeptide Is separated from unreacted patient serum, and the polypeptide is 
reacted with a labeled human IgE-reactive agent after separating from unreacted 
patient serum. The polypeptide is then separated from unreacted labeled human 
IgE-reactive agent, and the labeled human IgE-reactive agent bound to the 
polypeptide after separating from unreacted agent is detected directly or indirectly 
to thereby indicate presence In the patient's serum of anti-cashew IgE. The amino 
acid sequence of the polypeptide for use in this test may also comprise at least one 
mutation selected from a deletion, a substitution, and an addition. 

The invention also includes a test for screening patients for allergy to cashews 
which may be embodied in a test kit comprising a first reagent containing at least one 
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purified polypeptide whose amino acid sequence comprises at least one sequence 
selected from SEQ ID NOS:4-25. The test kit preferably also includes a second 
reagent containing at least one labeled human IgE-reactive agent. Additionally, In 
the test kit the first reagent may include a solid phase carrying the purified 
5 polypeptide. 

The skilled will find additional guidance in carrying out the invention by 
consulting Sambrook et al., 1989, Molecular Cloning, A Laboratory Manual, Cold 
Spring Harbor Press, N.Y., and also Ausubel et al. (eds.), 1995, Cun-ent Protocols 
in Molecular Biology (John Wiley & Sons, N.Y.). In addition, guidance in diagnostic 

10 and screening test methods suitable for use with the polypeptides of the present 
invention may be found in Manual of Clinical Laboratory Immunology, Sixth Edition, 
Rose et al. (eds.), 2002, American Society for Microbiology, Washington, D.C. As 
noted above, these publications are incorporated herein in their entirety. 

Accordingly, in the drawings and specification, there have been disclosed 

15 typical prefen^ed embodiments of the invention, and although specific terms are 
employed, the terms are used in a descriptive sense only and not for purposes of 
limitation. The invention has been described in considerable detail with specific 
reference to these illustrated embodiments. It will be apparent, however, that various 
modifications and changes can be made within the spirit and scope of the invention 

20 as described in the foregoing specification and as defined in the appended claims. 



Sequence Listings 

Following is a written sequence listing for nucleic acid SEQ ID NOS: 1-2 and 
for polypeptides SEQ ID NOS:3-25. Also enclosed is a complete sequence listing 
25 in computer readable form. The information recorded on the form is identical to the 
written sequence listing. 



